Picture for Longbo Huang

Longbo Huang

Best-of-Both-Worlds for Heavy-Tailed Markov Decision Processes

Add code
Feb 03, 2026
Viaarxiv icon

Reparameterization Flow Policy Optimization

Add code
Feb 03, 2026
Viaarxiv icon

Finite-time Convergence Analysis of Actor-Critic with Evolving Reward

Add code
Oct 14, 2025
Viaarxiv icon

Finite-Time Convergence Analysis of ODE-based Generative Models for Stochastic Interpolants

Add code
Aug 10, 2025
Viaarxiv icon

OM2P: Offline Multi-Agent Mean-Flow Policy

Add code
Aug 08, 2025
Viaarxiv icon

Reparameterization Proximal Policy Optimization

Add code
Aug 08, 2025
Figure 1 for Reparameterization Proximal Policy Optimization
Figure 2 for Reparameterization Proximal Policy Optimization
Figure 3 for Reparameterization Proximal Policy Optimization
Figure 4 for Reparameterization Proximal Policy Optimization
Viaarxiv icon

Proxy-Free GFlowNet

Add code
May 26, 2025
Viaarxiv icon

Continuous K-Max Bandits

Add code
Feb 19, 2025
Viaarxiv icon

Finite-Time Analysis of Discrete-Time Stochastic Interpolants

Add code
Feb 13, 2025
Viaarxiv icon

Few is More: Task-Efficient Skill-Discovery for Multi-Task Offline Multi-Agent Reinforcement Learning

Add code
Feb 13, 2025
Viaarxiv icon